ON THE CONCENTRATION OF CERTAIN ADDITIVE FUNCTIONS 

DIMITRIS KOUKOULOPOULOS 



Abstract. We study the concentration of the distribution of an additive function, when 
the sequence of prime values of / decays fast and has good spacing properties. In particular, 
we prove a conjecture by Erdos and Katai on the concentration of f{n) — X]p|n(^'^SP)~'^ ^'^^ 
c> 1. 



1. Introduction 

An arithmetic function / : N — )■ M is called additive if f{mn) = f{m) + f{n) whenever 
{m,n) = 1. According to the Kubilius probabilistic model of the integers, statistical prop- 
erties of additive functions can be modeled by statistical properties of sums of independent 
random variables. We describe this model in the case that / is a strongly additive function, 
that is / satisfies the relation /(n) = ^p|„/(p); the general case is slightly more involved. 
Let P denote the set of prime numbers and consider a sequence of independent Bernoulli 
random variables {Xp : p G P} such that 

Prob(X„ = 1) = - and Prob(Xp = 0) = 1 - -. 

p p 

The random variable Xp can be thought as a model of the characteristic function of the event 
{n G N : p\n}. Then a probabilistic model for / is given by the random variable Yip f{p)^p- 
The above model and well-known facts from probability theory lead to the prediction that 
the values of / follow a certain distribution, possibly after rescaling them appropriately. In 
fact, the Erdos- Wintner theorem [8] states that if the series 

(1,1) M, ^ m, 53 i 

i/(p)i<i ^ i/{p)i<i ^ i/{p)i>i^ 

converge, then / has a limit distribution, in the sense that there is a distribution function 
F : M ^ [0, 1] such that 

FAu) := - — -\{n < X : f(n) < u}\ — )■ F(u) as a; — t- oo 

for every m G M that is a point of continuity of F; the characteristic function of F is given 
by 

Conversely, if / possesses a limit distribution, then the three series in ( II. ip converge. 
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One way to measure the regularity of the distribution of the set {fin) : n G N} is by 
its concentration. In general, given a distribution function G : M — [0, 1], we define its 
concentration function to be 

QG(e) = sup{G(M + e) - G{u)}. 

We seek estimates for QfS^)) or for Qf{.^) if / possesses a limit distribution. There are 
various such results in the literature, a historic account of which is given in [1]. The most 
general estimate on QfA^) is due to Ruzsa [12]. Improving upon bounds by Erdos [5] and 
Halasz [9jj, he showed that 

N ^ -1/2 

min{l, (/(p) -\\ogp)y 



;i.2) A^ + E 



p 

p<x 

This result is best possible, as it can be seen by taking f{n) = clogn or /(n) = (jj{n) = 
J2p\n However, both of these functions satisfy f{p) ^ 1. So, a natural question is whether 
it is possible to improve upon (11. 2p in the case that f{p) decays to zero. Erdos and Katai [7], 
building on earlier work of Tjan [H] and Erdos [6], showed the following result: 

Theorem 1 (Erdos, Katai [7J). Let / : N — )■ M 6e an additive function such that 
for some constants A and B. Then 

except for A and B, the implied constant depends on the implied constants in the assumptions 
of the theorem too. 

On the other hand, when f(p) <^ 1/p^, p G P, for some 5 > 0, then (II. 2p applied to //e 
yields an upper bound for Qf{^) that is never better that 1/ y^k)gIog(T7e)- 

Also, Erdos and Katai studied Qf{^) in the case that f{p) = (logp)^'^, p G P, for some 
c > 1. They showed that 

'e'/' <c QFie) <e e^/^loglog'(l/e) if c> 1, 
e < QF{e) < elog(l/e)loglog2(l/e) if c = 1, 

for < e < 1/3. Furthermore, they conjectured that when c > 1, then 

QF(e)x,eVc (o<e<l). 

The conjecture of Erdos and Katai was proven for c large enough by La Breteche and 
Tenenbaum in [Ij: 

Theorem 2 (La Breteche, Tenenbaum P). Let c > 1 and / : N — M foe an additive function 
such that \f{p) \ X (logp)"*^ for every p G P and 

\f{Pl) - f{P2)\ » iPuP2 e P, pi < P2). 
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If c is large enough, then we have that 

Q^(e)xeV'= (0<e<l); 

the implied constant depends at most on the implied constants in the assumptions of the 
theorem. 

La Breteche and Tenenbaum derived their theorem from a general upper bound on Qf{^) 
that they showed when the sequence of prime values of / satisfies certain regularity assump- 
tions. Their method uses a result from the theory of functions of Bounded Mean Oscillation, 
first introduced by Diamond and Rhoads [2J in this context to study the concentration of 
fin) = log(0(n)/n). 

In the present paper we improve upon Theorems [1] and [21 In particular, we settle the 
Erdos-Katai conjecture for every c > 1. We phrase our results in terms of the distribution 
function 

p<y ^ ^ p\n^p<y 
f{n)<u 

defined for every y >1. From a technical point of view, this function is more natural to work 
with than F^. Indeed, a calculation of the characteristic function of J-y immediately implies 
that J-y converges to F weakly, provided that the latter is well defined. It is relatively easy 
to pass from estimates for Qj^yi.^) to estimates for QfS^)- 

Our first result is Theorem |3] below, which will be proven in Section [31 For the upper bound 
in it we use a combination of ideas from |6l [7], whereas the lower bound is a straightforward 
application of Theorem 1.2 in [T]. Observe that by letting ?/ — )■ oo in Theorem [31 we deduce 
as special case£| Theorems [H and [3 

Theorem 3. Consider a additive function / : N — )■ M for which there is a set of primes V 
and a constant c G [1,2] such that 

l/(p)l<7r^ (pe^), '^nd V-<i. 

logp)'^ ^ p 

Fort>2 set 

_ sup{|/(p)|(logp)'= -.pyt^peV} 
{logty 

and assume that there is some A> 1 such that 

\fiP2) - fiPi)\ » raml ^^P'^^^'~^'\ g{p^)] ip,,p, G P, pi < p^)- 

[^ P2logP2 J 

Then for < e < 1/2 and y > K^eY^"^, where K{e) = min{n G N : n > 3, g{n) < e}, we 
have that 

1 min{^,log^} 

except for A, the implied constants depends on the implied constants in the assumptions of 
the theorem too. 

As an immediate corollary, we deduce the Erdos-Katai conjecture: 



^To deduce Theorem[Tl take = {p G P : \f{p)\ < p ^^'^^}; see also the proof of Theorem [4] in pagefTOl 
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Corollary 1. Let c > 1 and / : N — t- M 5e an additive function with f{p) = {\ogp) for all 
p eF. For < e < 1/2 we have that 

e'/' < Qrie) < min { ^r^y. log \ } e'^'- 

Proof. This follows by applying Theorem |3] with min{c, 2} in place of c, A = 1, and y > 
exp{10e~^/'^}, and then letting ?/ — )■ oo. □ 

Remark 1.1. When < c < 1, the behavior of Qp for / as in Corollary [T] is different. As 
Gerald Tenenbaum has pointed out to us in a private communication, in this case we have 
that 

(1.3) QF(e)x,e (0<e<l). 

Corollary [1] and relation ( II. 3p give the concentration of an additive function / with f{p) = 
(logp)"'^, p G P, for all positive values of c except for c = 1, which is the only case remaining 
open. 

Remark 1.2. There is a heuristic argument which motivates Theorem |3l Assume for sim- 
plicity that the sequence {|/(p)|(logp)'^ : p G P} is decreasing so that the assumptions of 
Theorem in] are satisfied with g{p) = \f{p)\ for all p G P. For every integer n we have that 



(logn)'^ 

p\n,p>K{e) p\n,p>K{e) ^ 

Since for a typical integer n the sequence {loglogp : is distributed like an arithmetic 
progression of step 1 [IDl Chapter 1], we find thatO 

p|n,p>_ft'(e) j>loglog_fi:(e) 

So only the prime divisors of n lying in [l,i^(e)) are important for the size of Qp[e). Note 
that for a prime number p < K{e) we have that \f{p)\ > e. Therefore if a and b are 
composed of primes from [l,i^'(e)], then it is reasonable to expect that \f{a) — f{b)\ is big 
compared to e, unless a and b have a large common factor. This leads to the prediction 
that Qf{^) ~ l/log-^(e); which is confirmed by Theorem [3] when c > 1. However, when 
c < 1 this heuristic fails, as (11. 2p shows, and the underlying reason is combinatorial: The 
pigeonhole principle implies the lower bound Qf{^) e for the concentration function of 
any distribution function F (see also |7], Remark 1, p. 297]). 

Finally, our methods yield the following strengthening of Theorem (H which will be proven 
in Section [31 

Theorem 4. Let / : N — )■ M 6e an additive function for which there exists positive constants 
rj, A and B such that 

Y and y 1< t^-^ (t > 1, M G M). 

^-^ p t 

p>t^ p<t 

u<f{p)<u+l/tB 



^The symbol here is used in a non-rigorous fashion to denote 'roughly less than'. Similarly, the symbol 
i' means 'roughly equal to'. 
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Then for y > e~^^^ we have 

log(l/e) 

except for T], A and B, the implied constant depends on the implied constants in the assump- 
tions of the theorem too. 

Notation. For an integer n we denote witii P~^{n) and P~{n) its largest and smallest prime 
factors, respectively, with the notational convention that P~^{1) = 1 and -P~(l) = oo. 

2. The main results 

In this section we state two general results and derive Theorems [3] and H] from them in 
Section [31 The first one of them. Theorem O is novel, whereas the second one. Theorem El 
is a corollary of Theorem 1.2 in [I]. 

Theorem 5. Let / : N — )■ M &e an additive function for which there is a set of primes V 
and a decreasing function Pj : (0, 1] — ?■ [2, +oo) such that 



« 1, 

(2.2) |/(p)|<e (0<e<l,pGP,p>P/(e)). 

Furthermore, assume that there is some A G (0, 1] and some p > 1 such that for all m G M, 
< e < 5 < 1 and 1 < z < w < min{Pj((5), Pf{t)^} we have the estimate 

(2 3) T ^/->(Ven 

\e/^ + log-(. + l) else. 

u<f{p)<u+e 

Let < e < S < 1 such that Pf{S) < Pf{e)^, set qj = Pf{2^5) for j > 0, and consider 
J G {0} U {j G N : 2^ < 1/6 and qj > {2^/e)P}. For y > qo we have that 

^^■^^ ^^''^'^<<i^ + ^|j2il^ + ^|j2^lf^' 

the implied constant depends at most on the implied constants in (12. ip and (12. 3p . 

Remark 2.1. The parameter A and the set V are introduced to make Theorem O more 
applicable. One can think of Pf defined by -P/(e) = max{p G P : \ f{p)\ > e}. Condition (12. 3p 
can be motivated as follows. Assume that 

(2.5) y ^^<r7 {0<r]<l). 

p>Pf{v) 

We have that \f{p)\ ^ 5 for p G {Pf{26), Pf{6)]. So if the sequence {f{p) : p G P} is 
'well-spaced', then we expect that 



V -<- V -~- V \iM<<i 

^ p^ b ^ p^ P ^ p 5' 

Pf{25)<p<Pf{5) ^ Pf{2S)<p<Pf(5) ^ Pf{25)<p<Pf{5) ^ 

w<f{p)<w+e 



by (ESD. 
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Proof of Theorem\^ For w > zvfe denote with ^(2;, w) the set of integers all of whose prime 
factors belong to P fl (z, w\. For every -u G M we have that 

E-^E-E- E - 

^ — ^ n ^ — ^ rii ^ — ^ no ^-^ ns 

P+{n)<y p|ni=>pGP\P n2&&'{qo,y) 136^(1,90) 

u<f{n)<u+t u-f{nin2)<f{n3)<u-f{nin2)+e 

(2.6) ~ ^ Ti^^n^^eMl ^ 

\v<f{n3)<v+e 

^ log?/ I ^ 1 
< sup < > — 

log go ^GK , ^3 

v<f{n3)<v+e 

by (EH). 

Next, fix f e M and let 

A/" = {n G ^(1, go) : ^ < fin) < v + e, p > qj ^ ] n}. 

Clearly, 

(2.7) 5: uj: j: i«5:!2i*«j2i^</\iog,„ 

^ ^ ^ n ^ ^ n ^ »2 gjloggj | — if J > 1, 

ne^(l,go)W P>gjP+(n)<qo P>9J ^ 'iJ to 'iJ 1^52'^loggj ~ 

f</(")<f+e p^|n 

which is admissible in both cases. So it suffices to bound X^riGArl/"^ from above. 

We partition M into certain subsets and estimate the contribution of each one of them to 
XlneAT V""- separately. Let Aq = ^(1, gj). Next, given n E J\f\ A/q we write n = apb, where 
p is prime and P'^ia) < qj < p < P~{b). For 1 < j < J let Afj be the set of n G A/" with 
qj < P ^ First, we bound XlneM) '^l'^- Writing n = mP^(n) = mp', we find that 

i^gM) -P+(m)<gj p'G-Pn{P+(m),gj] 

(^2.8) «-/M</(p')<''-/'M+e 

^ \ . eloggj 

^ + 2^ m ^ log2(l + P+(m)) J ^ 

by (12. 3p with w — f{m), 2"^ 6, P^{m) and gj in place of m, 5, z and w, respectively. Next, we 
bound YlneNj ^/n ioi j E {1, . . . , J}. We have that 

El ^ 1 V 1 1 V 1 

n ~ h ^ ai ^ a2 ^ p 

v-f{aib)-f{a2)<f{p)<v^f{aib)^f(a2)+e 



(2.9) 



< 



E 1 E ;7-^^Pi E ;^ E 



1 



b&-^{qj,qo) a^€.9'(q^,qj) \ a2&^{hq)) p&Vn{qj,qj-i] 

w-fia2)<f{p)<w-f{a2)+e 
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Fix some w G M and consider 02 G ^{l, Qj) and p eV H {qj, qj-i] with w < /(a2) + f{p) < 
w + e, as above. Since \f{p)\ < 2-^5 for j9 G P fl (gj,+oo), by (12. 2p . we must have that 



/(aa) =w; + 0(2^5). So 

E - 



02 



E 



1 



E 



1 

a2 



E 



(2.10) 



w-f{a2)<f{p)<w-f{a2)+e f{a2)=w+0{2^&) w-f(a2)<f{p)<w-f{a2)+t 

^2^6 ^ 02' 

a2e.52{l,qr^) 
f{a2)=w+0{2U) 



1 



by the first part of (12. 3p apphed with w — f {0.2)1 2-'~^5, qj and in place of m, 5, z and 
w, respectively, since q^ > qj > {2'^5/e)P > {2^/e)P. Finally, writing 02 = mP+(a2) = mp', 
we find that 



/(a2)=«)+0{2J5) 

For every m G N we have that 

1 



E - = o(i)+ E - E 

^ a2 ^ m ^ 



m ^ — ' p' 

e,9f(l,g)) p'£Vn{P+(m),q)] 

f{p')=w-f{m)+0{2U) 



E :7 ^ 

p'G-Pn(P+(m),g}] 
/(p')=w'~/M+0(2^'5) 



E 



2^6 
> < :7TT + 



p'G-Pn(P+(m),g}] 



2^5 log2(l + P+(m))^ 



I, y</(p')<J/+2^5 

by (12. 3p with y, 2-'5, 2"^(5, P^{m) and in place of m, e, 5, z and w, respectively. So we 
find that 



^ , 02^"^^ ^ m V2-^5 ^ log^(l + P+(m)) j'^ 



026^3^(1,9;)) 
/(a2)=-!«+0(2-',5) 

Combining the above estimate with (12. 9p and (I2.10p implies that 



A log qj 



E^«i^(i 



A log qj 

2J-j 



E ^ E 

be.9'(qj,qo) aie^?^{q),qj) ^ 



Wog qj\ log go 



2-^--'' 7 Alogg/ 



□ 



which together with relations (12. 6p . (12. 7p and (12. 8p completes the proof. 

As we mentioned above, the next theorem is a corollary of Theorem 1.2 in [1]. Notice also 
its similarity with Theorem 2 in [7]. 

Theorem 6. Let / : N — i- M 6e an additive function and < e < 1. // there is a set of 
primes V and some M >2 such that 

and V 1^(^)1 

p&\V ^ 

then for y > M we have that 



p(^'P,p>M 
1 



p 



< e. 



logM' 
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the implied constant depends at most on the implied constants implicit in the assumptions of 
the theorem. 

Proof. Without loss of generality, we may assume that M is big enough. Let 

e p 

Define ^ : N ^ M by 

f{n) if p G [1, y]r[V for all primes p|n, 
else, 

and call G its distribution function. Then Theorem 1.2 in [1] yields that 



9[n) 



QcA^C.) > (i - 2 . H + o(i)) n (i - i) » (M oo). 



So, by the pigeonhole principle, we deduce that 
(2.11) CM.)»^»c ' 



C + 1 " logM 
Finally, we have that 

ogw=bup n (i-^) E ^ n (i-^)"«<3-.w. 

\ u<f{n)<u+e ) 

which together with (12. lip completes the proof of the theorem. □ 

3. Proof of Theorems [3] and H] 

We conclude the paper by showing how to deduce Theorems [3] and H] by the results of 
Section H 

Proof of Theorem 0. Before we delve into the details of the proof, we note some properties 
of the functions g and K. The definition of g implies that the function t — t- g{t){logtY is 
decreasing and, in particular, that g is strictly decreasing. Also, for p E V with p > K{e) 
we have that 

(3.1) |/(P)| < «(P) < s(m)mW ^ e(log(A-(.)))= 



Finally, we claim that 

(3.2) log(K(5)-l)>i(l)'^'log(K(e)-l) {0<e<6<l). 

Indeed, if K{S) = K{e), this holds trivially, and if K{6) < K{e), then we have that 
^ ^ gjKie) - mogiKje) - 1))^ ^ e(log(ir(6) - 1))^ ^ e(log(ir(6) - 1))^ 



g{K{6)){\ogK{6)y " 6{\ogK{6)Y - 6{2\og{K{6) - 1)^ 

since K{e) > 3 for all e G (0, 1]. In any case, (13. 2^ holds. 

Now we are ready to proceed to the proof of the theorem. First, note that the hypothesis 
of Theorem [6] is satisfied with M = K{e) and V, by (13. ip . and the desired lower bound 
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follows. For the upper bound we show that we may apply Theorem [3] with Pf = K — 1, 
V, A = 1/A and p = 2. Condition (12. ip holds by assumption and condition (12. 2 p follows 
immediately by (13. ip . Lastly, we show (12. 3 p with X = 1/A and p = 2. First, we prove some 
intermediate estimates. Let 0<e<(5<l,MG]R and 3 < z < w < min |Py((5), P/-(e)^/'^} 
with z > w^l'^. By assumption, there is an absolute constant C > such that 

- /(P2)| > ^min I^M^l^,^ < e P). 

We claim that 

{ min{e/5, 1/ log w} if z > 

(3.3) ^ - < i min{e/5, 1/ log w} log(25/e) + Ijz \i z < {bjefl^, 

pevn(zM P minieM, 1/ log w} + log'^(z + 1) if z < 

u< f (p) <u+min{e, S/ log w}/C ^ L/'/toJitoV'y V/; 

Indeed, if pi < p2 both belong to the set {p G Vn{z, w] : u < f{p) < M+min{e, 6/ log w}/C}, 
then we have that 
1 
C 



(3.4) 



mm <^ e, \ > \ f{pi) - /(ps)! > — mm <^ ,g[p^ ) ^ 

I log w J '-'I P2logp2 J 

2 . fS{p2-pi) 
> — mm 



C { P2logP2 

and, consequently, 

n / ^ P2logP2 . Je 1 \ P2 

0<P2-Pi< ^ mm <^ -, ^ < — . 

2 l^d logwj 2 

Relation (13. 3 p then follows, by the Brun-Titchmarsch inequality [131 Theorem 9, p. 73] for 
the first and the second part, and by the Prime Number Theorem [131 Theorem 1, p. 167] for 
the third part. Next by breaking the interval {u, u+e] into at most 1+C max{l, e{\ogw)/6} < 
1 + C log w intervals of the form (f , v + min{e, 6/ log w}/C], we deduce that 

^ f e/5 if z > 

(3.5) Yl - « 1 (^/^) log(We) + (log z)/z if z < (5/e)'/^ 

P^vn{zM P \e/6 + log-2 ;z if ^ < (5/6^/^. 

n<f{p)<u+e K / ' S, \ I J 

We are now in position to show (12. 3p : consider 0<e<5<l,MG]R and 3 < z < w < 
min {P/(^), P/(e)i/^}. Note that for all j > with 2^ < 1/5 we have that 

\ogPf{2^) > 2-^-^ log Pf (6) > 2-^-^/^\ogw > A-Hogw, 

by (13.21) . Consequently, relation (13. 5p with 2^6, max{z,w^ ' ^} and w'^ ' in place of 5, 2; and 
w, respectively, implies that 

if z > (5/e)2 and 2^- < V^, 



peT'n(2,i«] 

u<f(p)<u+e 



log ( — ) + if^>(5/e)'and2^ >^. 



^ 1 e , /2^(5\ 4-^logti; 

e le-' 
2-Jd (logw)"' 



Summing the above inequality over j > with 2^ < 1/6 and w'^ ^ > z completes the proof 
of (12. 3p . In conclusion, we may apply Theorem O with Pf = K — 1, p = 2 and A = 1/A. 
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Consider < e < 1/2 small enough so that K(e) > 3^ and 5 = g ([A'(e)^/^J) lies in 
[e, 1/2]. Since g is strictly decreasing, the definition of K implies that 

(3.6) m = [K{ey/^\ . 

In particular, Pf{6) < P/(e)i/^. For j e Z with 1 < 2^' < 1/6 we set qj = Pf{2^) = 
K(2^) - 1. Note that 

(3.7) logg, >2-i-(^-*)/^logg, (0 < « < j, 2^' < 1/5), 
by (1321). Set 

J = max ({0} U {j G N : 2^ < l/S and qj > . 
Then Theorem [S] and relation 03.71) imply that for y > K^ey^"^ > go 

log go ^ Tit 2^ " ^ ^ <ij 

1 e 1 e log gj 

(3 8) ^ ^ ^ ^ 2^"'^' 90 ^ 5 ^ 2-^-^/'= log go 

min{^,l + Je/5} e loggj 



•C 



log go (5 2-^(i-i/c)loggo 

min{^,l + Je/(5} e log gj 



log go (5max{l, (J+ l)(c - l)}loggo' 

Finally, note that if 2"^+^ < 1/5, then the maximality of J and (13. 7p imply that 

loggj < 41oggj+i < 8 log(2-^+i5/e) < J + 1 + log(5/e). 

On the other hand, if 2-^+^ > 1/5, then gj < ^^(1/2) < 1, since g{t) < (logt)""". In any 
case, we find that loggj ^ J + 1 + log(5/e), which together with the inequality 

Je elog(l/5) , 1 
5 e 

and relations (13. 6 p and (13. 8p completes the proof of the theorem. □ 

Proof of Theorem ^ The lower bound is an immediate consequence of Theorem El For the 
upper bound, note that 

|/(p)|>p-i/2A ^ p ^ k>l 2''— i<p<2fe ^ fc>l 

So if we set V = {p : \f{p)\ < p-^s^}^ p^(e) = ^~2A^ ^ _ 1/(2A5), and p = 1/r], then the 
assumptions of Theorem [5l are satisfied. Applying this theorem with 6 = e^/^^-S ^^^^ J = 0, 
we deduce that 

X 6 1 

log(l/5) 5 log(l/e) 
which completes the proof. □ 
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